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Growing experimental evidence shows that proteins follow one or a few distinct paths when 
folding. We propose in this paper a procedure to parametrize these observed pathways, and from 
this parametrization construct effective Hamiltonians for the proteins. We furthermore study the 
dcnaturated-native transitions for a wide class of possible effective Hamiltonians based on this 
scheme, and find that the sharpness (tightness) of the transitions typically are close to their the- 
oretical maximum and thus in quantitative accordance with the sharp folding transition observed 
for single domain proteins. Finally we demonstrate that realistic folding times are typical for the 
, proposed class of Hamiltonians, and we discuss the implication of the predicted entropy barriers on 

^— ^ ' the temperature dependence of the folding times. 

PACS: 05.70.Jk,82.20.Db,87.15.By,87.10.+e 

I—* 

o 



c3 



I. INTRODUCTION 



Globular proteins exists in two states the native, or folded, state and the denaturated, or unfolded, state |Q. The 

■ folding of a protein is mostly driven by thermodynamics, and it typically occurs on a time scale of order seconds. 
The folding, and the resulting conformations result from an interplay between polar and nonpolar amino acids on 
the protein, their hydrogen bonds, and how all these interact with and constrain the surrounding water. The folding 
is therefore an immensely complex process and to date it is still a completely open problem how one may predict a 

• protein's folding properties given its structure. 

On a qualitative level it is remarkable that the native state of a globular protein is solid with specified positions 
of all amino acids. This fact is thermodynamically reflected in a denaturation phase transition which is exceedingly 
sharp, in spite of the small number of molecules in the protein. Enhancing this qualitatively remarkable phenomenon 
is the fact that a typical polymer folds with a very weak transition (2|. In fact single domain protein folding transition 
fairly accurately behaves as a two-state thermodynamical system ||| where only denatured and native state can be 

■ populated and nearly no states with energies between these two extremal states are observed. Another remarkable 
£^ | feature of proteins is that they reach their unique ground state in a time that is much smaller than the time needed to 

search the extremely large configurational space. This is commonly known as the Levinthal paradox, and is noteworthy 
^ \ because it is difficult to merge with the two-state thermodynamics of proteins. 

Research into the more specific aspects of the dynamics of protein folding is divided into several sub camps. For 
k> | example, there is a huge effort to study simplified models numerically using ideas originating from spin glass theory. 
The basic idea here is that the different possible conformations of a protein define an energy landscape containing 
multiple minima. Such a picture is supported by experimental evidence at very low temperatures and structures 
confined by potentials less than about 2kcal/mol [Q. However, if the dynamics of proteins at normal temperatures 
were that of spin glasses, its folding would be even slower than a random search in configuration space, resulting in 
time scales on an astronomical scale. This is not the case. Today, one imagines two possible ways to explain how 
proteins can fold on reasonable timescales: (1) The existence of folding funnels or (2) the existence of folding 

pathways [||^] . The basic idea behind the folding funnel is that it is a deformation of the energy landscape in such a 
way that it resembles a funnel. The protein then "falls" into the global ground state without getting stuck. 

The folding pathway is a concept related to a specific sequence of events. In contrast to a funnel, a pathway is 
characterized by subsequent formation of identifiable structures that must be formed in a specific order. Pathway 
studies of proteins that fold have been extensively carried out by Fersht jl0| . Occasionally one observes more than 
one possible pathway. The mechanism leading to the existence of guiding pathways may be hierarchical from small 
scales to larger scales in the sense of Baldwin and Rose |||)] , or it may be purely sequential as suggested in [jllj . It 
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is the aim of this paper to demonstrate a third possibility, that encompass both of the above suggested scenarios but 
do it in a way that ensure the experimentally observed two state nature of protein folding-unfolding. 

In the next section, we introduce and discuss our parametrization scheme for protein folding. This is based on 
our earlier work on one-pathway protein folding [pl]-^3[. In order for a protein to be biologically relevant, certain 
demands has to be put on its folding properties. One such demand is that the folding-unfolding transition is sharp. 



The sharpest possible transition is obtained when the system is two state. We analyse in Section III a class of 



Hamiltonians parametrized by the scheme outlined in Section II B with respect to sharpness using the van't Hoff 
relation. We find that the typical system within this class behaves as a two-state system with maximal sharpness of 
transition. In Section |lV| we finally demonstrate that the class of Hierarchical folders encoded by the parametrization 
scheme of Section [n] have folding times compatible with the ones that are experimentally observed. Further we 
demonstrate that the phase transition is sharpened by an entropic barrier and therefore that the folding time increase 
strongly with temperature T in some T interval below the equilibrium transition temperature T c . In Section |v| we 
conclude that both demands, the sharpness of transition and the short folding times, are encoded by sequentially 
encoded hierarchical folders. 

II. PATHWAYS 

We take the point of view that proteins follow certain pathways when folding from the denaturated to the native 
state. The "standard" theorist's way of envisionin such a pathway is as a curve in some rugged energy landscape 
leading to the global minimum point — which corresponds to the native state. In the present work, we take a different 
point of view. We define a pathway as a sequence of partially folded structures which sequentially reduces both energy 
and entropy of the protein, and thus guide it towards the native state. By a partially folded structure we mean a 
protein conformation where parts of it has taken on the structure of the native state. 

The simplest model of subsequent entropy pruning is a zipper ]l4| , where one at each step k of the folding gains one 
unit of energy, say E[n + 1) = E(n) — a, and simultaneously reduces the available number of conformational states 
with a factor g, i.e., state space S(k) — ► S(k + 1) = S(k)/g. If each step is associated with binding of one amino 
acid from a random coil then g ~ 6 corresponding to the possible orientations of one amino acid along the chain . 
If we deal with reduction of effective number of orientations form a compact molten globule, then g ~ 6/e ~ 2 where 
the factor e comes from excluded volume effects ]l5| |. For individual amino acid contacts the corresponding energy 
gains may be estimated from the Miyazawa-Jernigan matrix Jl6[ , suggesting energies a of order 1 — > 2 kcal/mol. In 
the first subsection we will explore the mathematical formulation of this simple zipper model, and demonstrate that 
it indeed predicts a first order phase transition, albeit a fairly weak one. 

Folding of real proteins are presumably more complicated than a simple zipper. Protein structure data collected 
by Baldwin and Rose |||| support that the native state of real proteins is reached from progressively larger parts of 
the protein which are stabilized by the smaller structures already in place. This indeed suggest a sequential ordering 
of events with some similarity to the zipper, but with the important difference that folding may initiate at different 
places along the amino acid chain simultaneously. If this was the case, the resulting independence of events would 
imply a factorization of the partition function into sub-parts, with the result of weakening or eventually destroying 



the phase transition altogether. As discussed in Section III, the melting phase transition of a single domain protein 



is remarkably sharp, implying that folding the protein is a very cooperative and correlated process. 



We will in Section III see that the observed sharpness of the phase transition of proteins puts severe constraint on 
any protein folding model, e.g. ruling out both the simplest version of the zipper model and the simplest version of 
the hierarchy model. In addition, there is growing evidence that proteins may follow alternate pathways during the 
folding process |,|,[i]]-[l9|, thus requiring a formalism that merge the conflicting interest between the need of dealing 
with more than one pathway, to fold in a reasonable time, and to maintain the native state sharply folded when it 
finally is reached. 

The purpose of this section is to develop such a formalism, thus encompassing both the pathway formalism and the 
possibility that folding proceed hierarchically from smaller to larger structures. We first assume that there is only 
one pathway. This will allow us to introduce the central ideas in our approach in the next subsection. In subsection 



[IB we subsequently generalize theses ideas into a scenario where several pathways are allowed. 
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A. Single Pathway 



A folding pathway is a sequence of partially folded structures. Each structure along the pathway may contain the 
previously folded structures as a subset. If there is only one pathway available for the protein, this will be the case. 
We are therefore dealing with a hierarchy where structures use previously folded structures as building blocks. It is 
also clear that the folding process towards the native state proceeds from smaller to larger scales. 

We number the partially folded structures as they appear in the folding process, i = 1, • • • , N. The binary variable 
ipi equals 1 if partially folded structure i exists — also if it forms part of an even larger folded structure. Otherwise, 
it is zero. Thus, the folding pathway may be parametrized by the sequence of inequalities 

i>i > i>2 > ^3 > • • • > IpN ■ (1) 

We will in the following refer to the ipi variables as folding variables. 

There is energy and entropy associated with the partially folded structures. There are the direct binding energies 
that keeps the folded structures together. Perturbing the folded parts of the protein requires more energy than 
perturbing the unfolded parts. The latter may change conformation with very low or no energy cost. Thus, these 
changes appear as entropy on the thermodynamical level. In order to model these mechanisms, we introduce a second 
set of variables, the structural variables £i to £at. These variables contain e.g. the angles <fi and tp associated with 
N — C a and the C a — C bonds of the part of the protein that folded. However, they are simplifications — and to 
simplify even further, we assume that they only take two values [H, 



^ = { 1 - S . (2) 



This latter simplification is not essential to retain an analytically tractable model. However, it makes it easier to 
introduce the model and discuss it in general terms. The choice of two states corresponds to a degeneracy factor 
.9 = 2, and one can easily generalize the results we will obtain to larger g values by simply assuming that g — 1 states 
are associated with the higher energy level. 

We now proceed to construct the simplest possible Hamiltonian that reflect the above discussion p3[ . It is 

JV 

H=-J2a (i) m, (3) 

i=l 

where ay' are positive coupling constants. 

When the protein is at stage j in the folding process, the energy associated with it is 

i=l 

The parts of the protein not yet folded provide entropy, which in this particular case consists of the variables 
' " " j £iv- The degeneracy of this protein configuration is 2 N ~ : > . 

Working with the Hamiltonian Eq. (||) with the constraints ([!]) is somewhat cumbersome. We will therefore make a 
coordinate transformation that builds the constraints into the Hamiltonian directly. As we will see in the next section, 
this transformation allows us to generalize the one-pathway Hamiltonian presented in this section to multiple-pathway 
Hamiltonians: Specify a set of pathways, and the corresponding Hamiltonian can be written down immediately. 

We assume in the following that the value 3 defined in Eq. (|2|) is so large that the &'s for which ipi = X, will not 
take on this value for any realistic temperature (T < 100 C). In this case, we may set 

ipi = <Ai , 

Ipi = (j>\<t>2 ■ ■ ■ (f>i , 
IpN = 4>l4>2 ■ ■ ■ (pi' ■ ■ 4>N , 

(5) 
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where there are no constraints on the new variables fa. We see that the constraints Eq. (|lj) automatically are obeyed. 
The Hamiltonian (g) now becomes 

H = -a {l Vi - a^fafa a^fa ■ • ■ <t>N , (6) 

We furthermore note that the degeneracy provided by the £j variables (as they may take on two possible values, 1 
and 1 — H, when the corresponding tpi is zero), is also build into the Hamiltonian. Assume e.g. that all fa to fa; are 
equal to one, this corresponds to tpi = • • • tpj = 1. Furthermore assume that 4>j+i — tpj+i = 0. The remaining faj+2 
to fa\ can take on any value without change in energy. Thus, to within a factor 2, the degeneracy is the same that 
provided by the £j variables in the original Hamiltonian (g). 

The Hamiltonian ([|) has a first order transition at a temperature T = f/log2 when aS 1 ' = ■ ■ ■ = a^ N ' = 1 [ pT| . 
This Hamiltonian is easily generalized to take into account the coupling between water and protein Ea] . 



B. Multiple Pathways 

We need to define what we mean by "multiple pathways." We return to the single pathway defined using the folding 



variables by Eq. (II A). Suppose we break this sequence of inequalities by removing one of them, say tpi > tpi+i, so 
that we have 

> • • • > 1>i , (7) 

and 

ipi+i > ■ ■ ■ > i>N ■ (8) 

Now, the status (0 or 1) of any ipj, where j ; < i is independent of the status of any ipk, where k > i. That is, we have 
created two independent folding "domains." 

In terms of the fa variables, the two groups of inequalities (0) and may be expressed as 

■01 = fa , 

Ipi = fa ■ ■ ■ fa , 



(9) 



and 

i/ji+i = fa+i , 
ipN = fa+i ■ ■ ■ fay 



(10) 



We may also construct the corresponding Hamiltonian, 

U = -a^fa - a^fafa a^fa ---fa- a^fa+i a^fa+i ■ • • <pN ■ (U) 

When setting up the Hamiltonian (|ll|), we have made the assumption that both pathways, 1 — * 2 — ► • • • — * i and 
(i + 1) JV have the same degeneracies associated with them, fixing the relative entropic contribution from 

each branch. By introducing additional independent variables fay+j, we may tailor these contributions. For example, 
the Hamiltonian 

H = -a^fa - a^fafa a^fa ---fa- a^fa+ifav+i a^fa+i ■ ■ • 4>n4>n+i (12) 

will increase the entropic contribution of the second branch by Tlog2 compared to the first branch. 

We may generalize the ideas presented above to systems with multiple folding pathways. This is done simply by 
generating telescoping groups of fa variables, one for each pathway. The corresponding Hamiltonian may be written 
down directly, 
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n = - E 4Ui - E a ?jMi - ■ • • - E « { &.Mm ■■■■ (is) 

i i^j ijtjjik... 

We have here added subscripts to the coefficients, reflecting which 0j variables are involved in the corresponding 
term in the Hamiltonian. 

In order to construct a Hamiltonian given a set of pathways, we start with organizing the pathways in a branching 
structure where each numbering refer to a folded substructure of the protein, as shown in Fig. [lj. The interpretation 
of this figure is that folding of structures 1 and 5 are independent starting points of the folding process. In order to 
structure 4 to be formed, structure 5 must already be in place. However, this structure is independent of structure 
1 having been formed. Structure 2, on the other hand, needs both structures 1 and 5 in place. Lastly, structure 3 
can only form if both structures 2 and 4 have formed. In the end, all structures must always be formed. A concrete 
example of such a set of pathways has been reported for staphylococcal nuclease . 

Assuming now that the reduction in degeneracy when either structure 1 or 5 form is the same, we may set up a 
"folding table" as shown in Table |. From this table, we read off the Hamiltonian 

U = -Of Vl - af'4>5 - 045 - 0125010205 ~ 0x23450102030405 ■ (14) 

In the following, we will study the thermodynamical and dynamical properties of Hamiltonians constructed in 
this way. There are two main questions to be addressed which are essential in connection with protein folding: (1) 
sharpness of folding transition and (2) shortness of folding time. In the next section, we will consider the sharpness 
of the transition. 



III. SHARPNESS OF TRANSITION 

An excellent measure of sharpness of a first-order transition is to study the van't Hoff relation |^2|,^3| which relates 
enthalpy change through the transition AH, the peak of the heat capacity C p , the absorbed heat, Q through the 
formula 

AH = a T - C ^ Tc) . (15) 

Here T c is the transition temperature, and a is the van't Hoff coefficient. The smaller this coefficient is, the sharper 
the transition. 

We may demonstrate this in the following way. In the present case, AH = Q (as there is no pressure in the 
system), which is the area of the bump in the heat capacity plot. The height of the hump, h, is C p , while the area Q 
is essentially the width of the top b multiplied by the height, h, Q — bh. We may thus write the van't Hoff equation 

b 2 h = all . (16) 

^From this equation it might seem that the opposite of what we are claiming is true: The larger the a, the wider the 
transition, as the height h is proportional to a. However, one must not forget the presence of the width b. We rewrite 
the left hand side of Eq. ( |l6| ) as 

{bhf 1 = aT c 2 . (17) 

Now keeping the area of the hump, bh = Q, constant, we see that the height h is inversely proportional to a. Hence, 
our statement that a smaller a implies a sharper transition follows, as sharpness is a relative concept. 

In order to understand the the a coefficient properly, we examine its physical meaning in the following. Let us 
assume a system with an equally spaced energy spectrum, l,...,iV. We furthermore assume a degeneracy g n for the 
nth energy level. The partition function is then 

Z = Y J 9ne n,T . (18) 
n=0 

For proteins there should be a unique ground state, so that gjy = 1. The energy is given by 
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E(T) = T 2 -^ log Z = -(n) , (19) 

and the heat capacity is given by 

C(T) = ^E(T) = i^M! . (20) 

These relations are of course valid at any temperature. The a factor can now be expressed in terms of the fluctuations 
of the energy levels at T = T c , 

N 2 

(n 2) _ {n f = _ . (21) 

a 

For any probability distribution we must have (n 2 ) — (n) 2 < N 2 /4. We only have equality when P(n = 0) = P(n = 
N) = 1/2. Thus in general 

a > 4 . (22) 

Equality a = 4 is only found when the system exhibits two state behaviour, i.e., when go = 9n^ N ^ T and otherwise 
g n = 0. This is fulfilled with the Hamiltonian TL — —N<pi ■ ■ -(f>N, i.e., when all terms aW = for all i < N in Eq. 
(|iT|). In general Eq. (^lj) can be reformulated as 

<s 2 ) - <s> 2 = - ■ (23) 
a 

which express a as the variance of the distribution of normalized energy states (s = n/N) at the transition point. 
This distribution could be bimodal, (a = 4), be flat (a = 12) or be centered around a mean intermediate energy 
(a > 12). The flat distribution is obtained when all aW = const in the Hamiltonian (|TT[). Of special interest is the 
class of Hamiltonians that correspond to the hierarchical folding suggested by Baldwin and Rose g^]. We show in 
Fig. H an example of such a hierarchical folding network with four levels. In this scheme the folding of smaller units 
form a necessary but not sufficient condition for the folding of larger units. A corresponding Hamiltonian may be 



constructed as outlined in Section II B , and we find 



H = -a^Vl - a^4>2 - «4 03 ~ a i 04 - 0125010205 ~ 446030406 ~ 0^3456701020304050607 • (24) 

At each level in the hierarchy a new variable is needed in order to make folding at level below a necessary but sufficient 
condition for folding at this level. 

As the number of terms in this hierarchical Hamiltonian grows, a — > oo. The reason for this is the statistical 
broadening due to the exponentially growing number of independent branches when moving "backwards" in the 
hierarchy towards smaller units. 

Privalov and Khechinasvili [^2| measured experimentally the value 

a = 4.2 ±0.2 . (25) 

for large group of small globular proteins. Thus real single domain proteins are close to two-state behaviour, i.e., far 
from a simply guided Hamiltonian, and very far from protein folding viewed as subsequent folding of independent 
subunits. Folding of single domain proteins, that in practical terms means proteins of about 100 amino acids is indeed 
close to a maximally cooperative process. 



A. A Class of Hierarchical Hamiltonians with a — 4. 



Based on spatial correlations of amino acids in real proteins, Baldwin and Rose argue convincingly that some sort 
of subsequent folding hierarchy must be implemented in real protein folding |^,^|. In last section we saw that the 
simplest hierarchical scheme is excluded for single domain proteins. We now explore a more elaborate hierarchical 
scheme, where correlations between folding subunits naturally becomes implemented on an early level. The simplest 
of these is the single pathway model of Eq. (j^. A general formulation is the Hamiltonian (|l3|). It has up to N 
groups of products of various length of <f>i variables. We will in next subsection investigate the broad subclass of these 
Hamiltonians which are restricted by having maximum one term of each product number, and see that they typically 
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have a w 4. This means that we investigate Hamiltonians as in Eq. ( J13| ) where the coefficient = 1 for one 

combination of ijk . . . and otherwise zero. Thus, there is one term with only one <f>i, one term with two <pi variables 
etc. 

We note that the folding system in Fig. [j] as described by Eq. ( |l4| ) does not include only one term at each level. 
However, the system in Fig. [l] can also be parametrized by introducing one additional auxiliary variable 06, thereby 
creating an entropy barrier in the thermodynamics. The corresponding Hamiltonian for Fig. [l] would then be 

H = -ap0i - 4 1)( ^506 - a456040506 - 0^5601020806 ~ <423 456 01 0203040506 ■ (26) 

This is the type of Hamiltonians that we wish to study. This subclass deals with systems where indeed different 
regions may start independent folding, but where some foldings are more difficult to initiate and thus becomes rate 
limiting. In other words, that different events takes very different times implies that some folding events acts as 
effective nucleation centers. When these barriers are overcome, subsequent foldings spread across the system along 
some fairly well defined paths. 

The nucleation barriers were associated to cases where several variables must fold simultaneously without previous 
guiding (05 and 06 in the above Hamiltonian (p6|)). In addition, different folding paths typically become correlated 
e.g. through a common origin, as in the above Hamiltonian where the events 05 and 0g both are needed in order to 
proceed folding along the parallel ways parametrized by respectively 02 and 04. Such correlation could e.g. be due 
to steric interactions that facilitate folding by bringing far-apart amino acids closer to each other. These associated 
entropy barriers and correlations are responsible for the effective two state behaviour that this class of hierarchical 
models exhibit. We will examine this in detail in the next section. 



B. Averaging over the Class of Hamiltonians 



We will in following show that the type of Hamiltonians discussed in the previous section displays remarkable 
thermodynamic properties. We shall consider more specificly the class of Hamiltonians defined by Eq. (|l3|), where 
there is only one term for each number of 0jS. The partition function for a Hamiltonians belonging to this class is 



z = J2°~ n/T 



(27) 



To gain insight into the thermodynamics of this class of Hamiltonians, we have calculated the mean of the partition 
function over all possible Hamiltonians. The result is 



(Z) 



N 

E 



n 

fe=i 



sl(N-k)l 1/T 



Nl(s - k)\ 



1) 



(28) 



The calculation is outlined in Appendix |A|. This gives us the thermodynamics of an average Hamiltonian. In order to 
find out if this is a representative result we have to calculate the fluctuations around this mean. All thermodynamics 
properties of a given Hamiltonian is derived from log Z, and the mean can be expressed as (log Z) = (log z) + log (Z) , 
with z = Zj (Z). We have in Appendix [b| transformed the expression for the fluctuations of the partition function into 
the function, Eq. (B6) and the result is plotted in Fig. ||. The fluctuations are extremely small, in fact (z 2 ) — (z) 2 — > 
as the system size increases. For example for for N ~ 100 they contribute ~ 10 -4 fcBT to the free energy, and 
hence without any physical relevance. In other words two randomly chosen Hamiltonians will have exactly the same 



thermodynamic properties, the system is self- averaging and Eq. (28) is the exact solution of the partition function. 

In order to study the folding transition, we define an order parameter, u, by the mean of the sum over all the <j 
variables 



N 



N 

£< 

j=i 



(29) 



When the protein is in the ground state (0,) = 1 and the order parameter is u = 1, at high temperatures (<j>i) = 1/2 
and u = 0. Since the class of hamiltonians that we study are self-averaging for large N, we may calculate the thermal 
mean values by taking the average over all Hamiltonians. We may thus write 
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£<^> = i£££^- w/T /^> 



M ■ 

{a} {0} 



fc=i 



ri(jy-fc)i 1/r 

TV!(s-fc)P j 



(30) 



where M is the total number of Hamiltonians. Fig. |] shows the order parameter at different temperatures. We clearly 
see that there is sharp first-order transition around the temperature T c ~ f .44. The sharpness of the transition is 
characterized by the van't Hoff a coefficient, which is calculated from the partition function Eq. (p8|). The results 
plotted in Fig. || show that it goes quickly to a — 4 for large TV, which is the lowest possible value, i.e. the sharpest 
possible transition. We conclude hence that the folding scheme we have introduced gives a first-order phase transition 
with a van't Hoff coefficient of a ~ 4, corresponding to a two-state thermodynamics. 

We note that for the single-path Hamiltonian, Eq. (^|) with all = 1, a equals 12. However, if the last term is 
set equal to a^ N ^ = eN, where e — > is small (and goes to zero as N in the limit N — > oo), then a w 4 p4| . 

IV. FOLDING TIME 

Proteins in their native form have well defined structures that are separated from their denatured counterparts 
through some barrier, as quantified by value of a. The denaturated state consists of an astronomical numbers of 
microscopic conformations, whereas the native state is unique. The fact that the unique native state anyway is 
reached fairly fast from this large number of unfolded configurations is known as the Levinthal paradox. Thus, 
the folding proteins must have some property encoded which lower configurational entropy systematically as folding 
proceeds. However, such guided folding has to be merged with the fact that the folding process exhibits effectively 
two-state behaviour, i.e. has few or no detectable intermediates |2j|, respectively a ~ 4 p3] |. 

Minimalization of folding time and the van't Hoff parameter a poses seemingly contradictory goals. This one might 
expect because absence of significant intermediates restricts the possibility of guided folding between the denatured 
states and the native state. As an example the following Hamiltonian has a = 4, i.e., a maximally sharp transition 

m, 

H = —N 0i02 • • • 4>n • (31) 

However, it has absolutely no guiding at all: When in the denaturated state, i.e., when at least one 0, = 0, there is no 
energy gain in turning this to </>j = 1, unless all other variables <fij already are equal to one. There is no energy gain 
unless such a protein moves into the native state. Searching for the native state in this system will grow exponentially 
with system size N. In general, it may seem that the more "two state" the transition is, the less guiding can be 
involved, and hence, the longer the folding time will be. 

In order to discuss folding times, we need an operational definition of folding time. We base our definition on the 
one-step Monte Carlo method pq] . The Monte Carlo method is a way to generate a biased random walk through 
configuration space such that the relative frequence of occupancy of any configuration is proportional to the Boltzmann 
factor. Thus Monte Carlo "time" , measuring the number of steps of the random walk, has strictly speaking nothing 
to do with "real" time. However, in a discrete system, such as our model, we believe that time defined in this way, 
essentially brings us as close as possible to time in a corresponding continuous system. Hence, we will base our 
definition of time in the model by one-step Monte Carlo. Hence, we define one unit of time as the times where 
all variables have been updated once in average. This definition of time is common when dealing with dynamical 
questions concerning discrete system p6| . 

The non-guided Hamiltonian ( pl| ) will fold in a time proportional to 2 N /N when starting from a random initial 
configuration. The fully guided (zipper) Hamiltonian (Q) will similarly fold in a time proportional to N . The Baldwin- 
Rose picture of hierarchical folding ||^| will give a folding time proportional to log N. These examples, as noted above, 
seem to suggest that Hamiltonians with larger a fold faster. However real single-domain proteins have a close to its 
minimum value 4, and are still able to fold in fairly short time. 



We have in Section [III A| suggested a class of hierarchical folders, and seen that these typically have a ~ 4 that is 
compatible with two-state behaviour seen in experiments. 

In order to examine the folding behavior, we restrict ourselves initially to the case of zero temperature, i.e., when a 
term has folded it never opens again. The fact that we at all can reach the ground state at zero temperature follows 
from all terms in the Hamiltonian being attractive (i.e., negative signs) and that the various terms in the Hamiltonian 
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guide the system toward the uniquely defined ground state (where all <j>i = 1). Thus, there are no energy barriers in 
our idealized model system, and folding is therefore only limited by degeneracy/entropy barriers. 

The zero temperature limit makes it possible to characterize the folding as function of number of subsequent folding 
steps, where one step is defined as an event where at least one term in the Hamiltonian becomes non-zero. Fig. ^a 
shows the time to fold subsequent steps of the Hamiltonian for a system of size N = 100. The (+) and (x) symbols 
denote trajectories of two different Hamiltonians of type ([l3]), whereas the full line show the ensemble averaged folding 
time (over all Hamiltonians and trajectories). We note that although the trajectories typically fold subsequent levels 
of the Hamiltonian, this is not necessarily the case, thus opening for alternate pathways. 

Entropy barriers are caused by the necessity to fold several 4>% terms simultaneously, and the chance to do this 
decreases as the l/2 fc , where k is the number of terms. In Fig. ^a we display the entropy barrier dS/dk defined as 
the number of simultaneous folding terms versus the folding step k for the N — 100 system. The histogram is dS/dk 
for one particular Hamiltonian, whereas the dashed line shows the barriers for the ensemble averaged values. Fig. 
Qb displays a data collapse of dS/dk for three different system sizes, demonstrating that both the position and the 
maximal barrier scale with system size approximately as y/~N, which implies that the folding time grows with systems 
size as 2^ . An argument for the obtained scaling is that after k folding steps, then as long as k is small, one typically 
has folded k 2 /2 terms. When fc 2 /2 becomes comparable to N, overlap between subsequent steps becomes significant, 
and subsequent folding involves fewer new folding variables and therefore folding becomes easier. 

In order to estimate whether these numbers are consistent with a fine grained behavior of real proteins then the 
degeneracy of each degree of freedom should be of the order of 6, corresponding to the orientational possibilities of 
a single amino acid on a random peptide chain. Thus exchanging 2 — > 6 in the above analysis, and noting that 
individual amino acids change conformations on a timescale of the order of nanoseconds Q , we very roughly estimate 
a folding time of 6^ x 10~ 9 s ~ 10 _1 s for N = 100 which is of the order found for proteins. 

Fig. ||a examines the zero temperature energy as function of folding step for a specific N = 40 system, where we 
associate one unit of energy to each contact term in the Hamiltonian. Thus the coefficients = 1, implying that 
all Hamiltonians of type ( |l3| ) have equilibrium transition temperature T c = l/log2. The figure shows that the first 
k ~ 12 steps is associated with a slow linear decrease in energy, hereafter a few subsequent steps is responsible for 
~ 65% of the total energy gain due to folding. Combining this with the fact that nearly all entropy reduction has to 
take place for k < 12 (see Fig. ^), we conclude that entropy is reduced before energy is gained. 

We now examine the effect of finite temperature where the increased importance of entropy converts entropy barriers 
into free energy barriers. This makes the initial folding steps thermodynamically disfavoured at temperatures below 
the equilibrium melting temperature T c . In Fig. ||b this is examined quantitatively, where free energy is shown versus 
an effective order parameter u* . defined as the number of tp variable which is equal to one and belongs to a term in 
the Hamiltonian which contributes a finite amount (i.e. E = 1) to the energy. Thus u* is linearly decreasing with 
the residual entropy of the system. The plot show the free energy profile for three temperatures, the T = case, the 
temperature where folding becomes dynamically suppressed, T = T* = 0.27T C (for N — 40), and the equilibrium 
denaturation temperature (T c = 1/ log 2) where the ground state just balances the denatured state. We conclude that 
in our simplified scenario, the sharpness of the transition at T c arises because of an entropy barrier that will make 
folding much slower in some limited interval below the equilibrium melting temperature. With the parameters of the 
figure one first obtains the fairly fast folding examined in Fig. ^ when the temperature becomes below T* ~ 0.27T C . 

We stress that one in principle could include energy barriers into our formalism by adding positive terms to the 
hamiltonian (i.e. —Qyl > in Eq. Jl3|)). Such terms could arise either if the denatured state has binding contacts 
which are not in the native state or if the transition state have non native contacts. Here we focussed on the possibility 
for a transition that is purely limited by entropy barriers, thereby defining a transition state which has little entropy 
and large enthalpy difference to the ground state, a feature observed in for a— spectrin SH3 by |27|. As a consequence 
we predict a temperature dependence that is opposite to recent measurement on protein folding kinetics, where 
typically the folding time decrease with temperature T for T <^ 40°C |27-29|. Energy barriers therefore plays a 
role at least at these lower temperatures. Future measurement of folding time behaviour close to the denaturation 
temperature could teach us whether the two-state behaviour found for folding of small proteins is due primarily to 
enthalpic or entropic barriers. 



V. CONCLUSION 



In this paper we have explored the idea that entropy correlations and barriers may appear at any step during 
a sequential folding process. We have seen that this is natural to expect as consequence of a hierarchical folding 
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process where different branches of the hierarchical folding structure are correlated. We have showed that this folding 
scheme can result in two-state thermodynamics even though the energy gain is microscopically small at each folding 
step. We have seen that this two state thermodynamics can be merged with a fairly fast folding process, with time 
increasing with A as 2^ instead of 2 N as one naively should expect for two-state folders. Further we have found 
the qualitatively same subdivision between initial and final folding as reported in p4j , without the need to resorting 
to a special treatment of any terms. This subdivision into early entropically disfavoured guiding and last step easy 
match predicts a folding time that increases with temperature when denaturation is approached. 

We thank A. Bakk and J. S. H0ye for many fruitful discussions. P. G. D. thanks the NFR for financial support. P. 
G. D. and A. H. thank Nordita and the Niels Bohr Institute for friendly hospitality and support. 



APPENDIX A: CALCULATING THE AVERAGE PARTITION FUNCTION 



We can make N\/m\ different products of m fa. Thus, there is in total 

AM A! A! 

M = 



(A - 1)! (A -2)! "' (A- A)! 

different Hamiltonians (note, however, that most of them are equal up to permutations of fa). 
We may calculate the mean value of the partition function 



(Al) 



{a} {4>} 

1 

M 



{4>} i i¥=j «W#A-" 



i>j<fy k .../T 



(A2) 



The Hamiltonian is invariant to all permutations of fa and this suggests the new variable s — 5Z i=1 fa- We then find 



E e0i/T 



l/T 



A! 



(s-1)! (A-l)! (s-1)! ' 



and 



5>* 



4> 3 /T 



l/T 



A! 



(s-2)! (A -2)! (s-2)! 



and so on. This gives us an expression for the mean partition function 

N 

M 



3 1/T 



A! 



s=l x ' k=l 

This expression can be further simplified to 

N 



(s-k)\ (A-fc)! (s-jfe)! 



A! 



N 



( z )=i+E(»in 



A!(s- fc)! 



(A3) 



(A4) 



(A5) 



(A6) 



APPENDIX B: FLUCTUATIONS OF PARTITION FUNCTION 



In this Appendix, we find the expression for (Z 2 ) which was used to show that the relative fluctuations {(Z 2 ) 
(Z) 2 ) / (Z) 2 goes to zero as A —> oo, as is seen in Fig. [| We have that 



M 

{a} {0} {0'} 

EE G (^')' ( B1 ) 

m {0'} 
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where 



G(<t>, </>') = ^2 Yl XI e (0l+0 « )/T e (< ^' +< ^ )/T ... ^ e (^^^-+0>-K-)/r _ ( B2 ) 

{0} {<t>'} * i^j^fc- 

The sum over <j> and <j>' is simplified by noting that the function G (</>,<//) is symmetric to all permutations of pairs 
(<f>i,<f>i)- This means that G can be expressed as 



G(<j>,<j> , ) = G(s,s',p), 



(B3) 



with s = J2iLi 4>ii s ' — J2iLi 4>'% an d V — J2"=i 4>i$i- The sum over all (pi and is thus reduced to a sum over s, s' 
and p. 



N s 



S=0 p=p m i, 

AT s-1 s' 



N\ ( N-p\ ( N-s 
P J \ s-p J \ s-p 



G(s,s,p) 



= 1 S '=0p=Pn 



AT \ f N-p\ (N-s 



p J \ s — p I \ s-p 



G(s,s',p) , 



(B4) 



where p m ; n = (s + s' — N)Q(s + s' — N). The function G(s,s',p) is a product of TV sums. Each sum can be be 
evaluated. The results is 



N 



G(s,s',p) = Y[ A m (s,s',p) 



(B5) 



m— 1 



where we have that 



A m (s,s',p) = 0(p-m) 



& P 2/T 
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+ 

+ 
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(p — m)! 



3 1/T 



(s' - m)! (p — my. 



+ Q(p-m)- 



(N — to)! (p — to)! 

Here 6(n) is the step function (6(n) = 1 for n > 0). 
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FIG. 1. Folding pathways shown as a directed network between partially folded structures, which have been numbered. 



FIG. 2. Folding pathway network corresponding to the hierarchical folding scheme described by Baldwin and Rose. 

FIG. 3. Relative fluctuations of partition function for different temperatures as a function of system size N. This shows that 
the system is self averaging strongly enough, so that \og(Z) — (log Z). 



FIG. 4. Order parameter u for N = 50, 100, 200 as a function of temperature T. 



FIG. 5. a coefficient for different system sizes N. 
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FIG. 6. a) Folding trajectories of N = 100 systems. We show time since denatured state as function of folding step k. If all 
terms in Hamiltonian are assigned equal energy a = 1, each folding step is associated by one unit of energy reduction. The (+) 
symbols denote three trajectories for one Hamiltonian, the (x) symbols show three trajectories for another Hamiltonian. The 
full line shows the ensemble averaged folding trajectory. One observes that full folding is reached after about 3 • 10 timesteps, 
and that it only require the first k ~ 20 terms in the Hamiltonian to be explicitly folded. 

FIG. 7. a) Folding barrier as function of folding step for the N = 100 system. The largest folding barrier is caused by the 
maximum number of 0- variables that must be set to one without guiding in order to make a term in the Hamiltonian non-zero. 
The (+) symbols illustrate a case where this barrier involves folding of 8 variables at folding step k = 9. b) Rescaled folding 
barriers ({N / 100) 1 ' ' 2 dS/dk) versus rescaled folding step ((iV/100) 1/2 fc) for three different system sizes (N = 25, 100 and 400). 

FIG. 8. a) Example of energy as function of folding step k for a system of N — 40 folding variables and all terms in 
Hamiltonian assigned a prefactor a' 1 - 1 = 1. Note the distinct difference between early and late folding regime, contrasting the 
linear decrease found for the simple zipper model, b) Free energy versus effective order parameter (defined in Eq. (p9[)) for 
three different temperature. Note that the monotonic decrease at low temperature is replaced with a free-energy barrier at 
intermediate temperatures. 
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TABLE I. Folding table for the pathway network shown in Fig. [l|. The columns refer to the variables 0j, while the rows 
refer to products of 0j variables. The left-most column refers to the number of 0^s in the products. The numbers to the right 
of the left-most column refer to the five structures in Fig. pj. 
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